E ective Speaker Tracking Strategies for Multi-party Human-Computer Dialogue
نویسندگان
چکیده
Human-computer dialogue is already a rather mature research eld [10] that already stemmed to several commercial applications, either service or taskoriented [11]. Nevertheless, several issues remain to be tackled, when unrestricted, spontaneous dialogue is concerned: barge-in (when users interrupt the system or interrupt each other) must be properly handled, hence Voice Activity Detection is a crucial point [13]. Moreover, when multi-party interactions are allowed (i.e., the machine engages simultaneously in dialogue with several users), supplementary robustness constraints occur: the speakers have to be properly tracked, so that each utterance is mapped to a certain speaker that had produced it. This is needed in order to perform a reliable analysis of input utterances [2]. Spoken human-computer dialogue systems can be seen as advanced applications of spoken language technology. A dialogue system represents a voiced and relatively natural interface between the user and a software application. Thus, spoken dialogue systems subsume most of the elds in spoken language technology, including speech recognition and synthesis, natural language processing, and dialogue management (planning). A dialogue system involves the integration of several components, which generally provide the following functions [3]:
منابع مشابه
Multi-Party Quantum Dialogue with the Capability to Expand the Number of Users at Runtime
Quantum dialogue is a type of quantum communication in which users can simultaneously send messages to each other. The earliest instances of quantum dialogue protocols faced security problems such as information leakage and were vulnerable to intercept and resend attacks. Therefore, several protocols have been presented that try to solve these defects. Despite these improvements, the quantum di...
متن کاملExploring the Characteristics of Multi-Party Dialogues
This paper describes novel results on the characteristics of three-party dialogues by quantitatively comparing them with those of two-party. In previous dialogue research, two-party dialogues are mainly focussed because data collection of multi-party dialogues is difficult and there are very few theories handling them, although research on multi-party dialogues is expected to be of much use in ...
متن کاملRhetorical Control of Semantic Ellipsis in Language Generation for Multi-Party Human-Computer Dialogue
This paper addresses semantic ellipsis control in language generation for humancomputer dialogue. For this, the rhetorical structure of the dialogue is used, as well as a repository of facts that are accepted by the recipient of the utterance. Rhetorical structure is represented in the framework of SDRT, where rhetorical relations are grouped in “confirmation” and “contradiction” relations; the...
متن کاملA Rhetorical Structuring Model for Natural Language Generation in Human-Computer Multi-Party Dialogue
Multi-party human-computer dialogue research is still in its infancy. Most of the research in this respect either addresses dialogues between pairs of computers, or performs studies on multi-party human dialogue corpora, in order to better understand this type of interaction. Thus, there are only a few computational models for this type of linguistic interaction and this paper tries to fill thi...
متن کاملChapter 4 EXPERIENCES OF MULTI-SPEAKER DIALOGUE SYSTEM FOR VEHICULAR INFORMATION RETRIEVAL
Currently, most spoken dialogue systems only deal with the interaction between the system and one speaker. In some situations, interactions may occur between several speakers and the system. New functions and improvements need to be made in order to handle a multi-user situation. Studies of the human computer interaction system that involve multiple users are in their initial stages and any pap...
متن کامل